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Abstract 

Several procedures proposed in the literature for the analysis of growth 
curves are reviewed. Particular attention is given to the current issues in 
this area to guide practitioners in the selection of the most appropriate 
methodology • 

1. Introduction 

In many experimental situations, especially longitudinal growth curve 
studies, data are collected on several variates and a subject is observed on 
each of the variates over time. Designs with multivariate observations on p^ 

variates observed over q time points are of tea referred to as multi-jres^ 
repeated measures or growth curve designs • 

To analyze multi-response growth. curve data obtained on N subjects, the 
data are conveniently arranged in a data matrix Y^(N x p^q) where the first q 

columns correspond to variable-one,^ the next q to variable 2, the next q to 

variable 3 and so on up to the p^^ variable. 

o ^ 

Given a data matrix Y with N. subjects in m groups, E N. - N, where for 

o i i-i ^ 

simplicity suppose that three measures are recorded on each subject at q time 
points so that associated with each subject are 3q measurements all correlated 
with unknown variance-covariance matrix E^, the i^^ growth curve for each of 

the three multivariate responses may be represented by 
(1.1) %«ll'^.--/»i.P2-l'''''\ "2^" 
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Employing matrix notation with the matrices B and P defined: 
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(!•!) may be represented by the matrix product BP. The matrix P given above 
has been represented as a super diagonal Vandermode matrix; alternatively, we 
could have used unnormaliead or normalized orthogonal polynomials. Letting 
X (N X tc) denote a design matrix, the growth curve model is represented as 

E (Y ) = XBP 
o 

(1.2) 

V (Y ) = i,,!a- z . 

o N o 

Representing the growth curve model (GQI) as (1.2), some common multi-response 
hypotheses of the form 

(1,3) H CBA = T 

... <---' ^' " 

where C (g x m) of full rank g, A (p^q x u) of full rank u, and r(g x u) are 
known matrices may be tested. 

To analyze growth curve data on one variate (p^ = 1) observed at q time 
points using (1.2), most authors reduce the to a standard MANOVA or MANCOVA 

" modelTr~Th"e~Froce^ 
tered is the major topic of this paper. Trom the representation of B and P, 
whether p = 1 or p^> 1 the extension to more than one variate is immediate. 



2. Standard MANOVA Model 

In a multivariate experiment involving N subjects on which p measurements 
arc observed, the data obtained may be represented by a (N x p) data matrix Y. 
Assuming the standard MANOVA model (SMM) describes the experiment, the model for 
the random matrix Y is denoted as follows. 

E (Y) = XB 
V (Y) = H Z 

The matrix X (N' x m) is a known design matrix of rank r< m< N, B (m x p) is a 
matrix of unknown nonrandom parameters and E(p x p) is a positive definite (p.d.) 
variance-covariance matrix of any p-variate row vector of Y. 



In the SMM, we are usually interested in estimating linear parametric func- 

tions of the form i^=:c'b a and sets of functions of the form CBA where c(m x 1) , 



a (p X 1), C (g X m) and A (p X u) are knovm, the R(C) = g and the R (A) = u. 
Assund.ng i|)is estimable, c belongs to the space spanned by X'X or equivalently 
c' = c' H where H = (X'X)"(X'X), Rao (1973) and Roy (1975) have shown that the 
best linear unbiased estimator (BLUE) of^= c' B a when it is estimable, is 
given by , 

i|) = c' B a 
(2.2) . ~ ~ • 

B = (X'X)~X'Y , ^ 

The variance of i|)is 

Ca' Z a) (c'(X'X)"c) . • 

(see, e.g. Timm, 1975, Section 3.6). Considering the linear set CBA, Roy (1964) 
showed that if CB A is any other unb iased estimator of CBA, other than CBA, that 



o 



V(CB^A) - V(CBA) is (p.s.d) 

-^max [^(C^o^)^^ -^ax^^^^^^^l 

Tr [V(CB A)]> [Tr V(CBA)] 
o 

|V(CB^A)1> |V(CBA)1 
where the V(CBA) = C (X' X)"'c' H A' E A. 

Hence, •H'= CBA is a unique solution in terms of the minimization of the trace 
criterion, and the generalized variance criterion because of the strict inequality 
AlthoughH- yields a minimum in terms of the maximum root criterion, the solution 
is not unique. 

The estimation of 'F under_2.1, the- raultivariate-Gauss-Markof f...setup.,_do.es™ 

not require distributional assumptions about the row vectors of Y. If, however, 



we assume that each of the rows of Y are independently normally distributed, then 
maximum likelihood estimators of the functions of B may be obtained- Writing the 
likelihood .function as 

(2.4) (2u)-^P/2 _^ ^ .1 ^ 

and solving the likelihood equations, the maximum likelihood estimators (MLE's) 
of B and E are: 

B = (X'X)~X'Y 

(2.5) 

E = Y' (I - X (X'X) X') Y/N 



Hence, the I'lLE of' CBA is Identical to the BLUE. Z is the unique MLE of E ; to 
obtain an unbiased estimator, Z is multiplied by N/(N-r). ' 

Hypothesis testing under 2.1, assuming normality, takes the general form 

(2.6) H : CBA = T 

o 

where C (g x m) is a known matrix of rank g^m, A (p x u) is a knox^n matrix of 
rank u <^p, r (g x u) is a known matrix (usually a zero matrix) and CBA is esti-- 
mable- Defining hypothesis and error sums of squares and products matrices of 
the form 

(2.7) = (CBA-r)' (C (X'X)^C')^-'- (CBA -F ) 
n 

Q = A'Y (I - X (X'X)"X')YA 
e 

^i = ch.(Q,Q ■■'■), i = 1, 2, .., s 

.. . ' 1 n e 



and 



where s min (R (C), R (A) ), numerous test criteria have been proposed to test 

H • They are: 
o 

Wilks' Lambda Criterion - 



iQ + Q I 

n ^e 
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Hotelling's Trace Criterion - 

Tr iQ^Q-h = ^ \ 
i=l 

Roy's Largest Root Criterion - 

ch, (Q^Qj') = \ or ch^ (Q^ (Q^ + Q^)-^) = B ^ = 

Pillai's Trace Criterion - 

s • s X 

Tr (Q,(Q^+Q„) ) = 2 a^. = E 2^ 

i=l • i-1 1+ 

Another criterion, although not extensively tabled includes: 
Roy's Minimum Root Criterion ch (Q^Q^""^) ~ ^ 

Although no one criterion is uniformly most powerful, the studies of 
Schatzoff (1966) and Olson (1974) show that under normality Roy's criterion is 
best for certain restrictive alternatives and that Wilks' criterion is best for 
a wide class of alternatives. The best robust criterion appears to be Pillai's 
trace criterion. 

3. Potthoff and Roy Model 

While 'che SMM is applicable in many experimental situations, the model has 
several limitations if an experimenter wants to analyze and fit growth curves 
to data collected over time. To analyze data obtained from a growth curve ex- 
periment, Potthoff and Roy (1964) developed the growth curve model (GCM) which 
is a simple extension of the SMM. 

The model considered by Potthoff and Roy is given by 

E(Y )= X B P 

(3.1) ° — 

^^V^'^N^ ^o- 

where Y (N x q) is a data matrix, X (N x m) is a known design matrix, B (m x p) 

. O ... -.,-r 

is a matrix of unknown nonrandom parameters, P (p x q) is a known matrix of full 
-•rank--p-<-q^--j:--^(q'-x-q-)---iS'-p . d.->and^the— pow^ 
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Comparing the GCM with the SMM, we see that only the post matrix P has 
been added to the model. This implies that each response variate can be ex - 
pressed as a linear regression model of the form 

E(y^) = P'ai 

where (q x 1) is the observation vector for the i^^ subject and is a 
vector of unknown parameters. 

To analyze (3.1), Potthoff and Roy suggested the transformation 

(3.2) Y = Y g"-^P' (PG^-^-P')"-^ 

^ . o ' 

where G(q x q) is any symmetric positive definite weight matrix either non- 
stochastic or independent of Y such that PG^'^P* is of full rank. Employing 

o 

the transformation in (3.2), the matrix Y(N x p) will be distributed mutually 
independently normal with unknown p.d. variance-covariance matrix 

2 = iViG'T'^'P'r'^ViG')'''^ 2 G'^-'"P'(PG-''"P')~'^ 
(p X p) 

and mean E(Y) = XB. Hence, by using (3.2) we have reduced the GCM to the SMM ^ 
with minor limitations on the selection of G. 

Motivation for the selection of the transformation in (3.2) by Potthoff 
and Roy is contained in Appendix B of their (1964) paper; they show that the 
BLUE of an estimable linear parametric function c'B a (where the estima- 
bility conditions are that c belongs to the space spanned by X*X and a belongs 
to the space spanned by the columns of P) is given by 

(3.3) / ij^ = c'B a , , 

B = (X'X)~X'Y^ E P' (P E^"-"- P')~•'■ 
_Silice_C3..JLJ_red\ices_to (2.1) under (3.2). we see that upon substituting Y in 
(-3...2.)_J.n.to„C2...2_) that , ; ^ , . IZZZ''~T~Z 



8 



B = (X'X)''X^Y^g'' P' (PG"**^P')7 
jwith G replacing in (3.3), is very close to the BLUE. 
To test hypotheses of the form 
(3.4) H^: CBA =r 

under (3.1), we merely have to substitute Y defined in (3.2) into the expression 

for Q, and Q in (2.6). The degrees of freedom for the hypotheses is v, = R(C)=g 
h fi n 

and the degrees of freedom for error is v^=N-ll(X)=N-r. 

Setting? =0 in (3.4) and letting Y be defined as in (3.2), the hypotheses 
and error sum of square and products matrices take the following form. 

« A'Y'xa'x)'"c' (c(x'x)"g')^*^c(x'x)"x'ya 

(3.5) 

= A'Y' (I-X(X'X) X')YA 

where v, = g and v - N-r. 
n e 

Under the SMM, we said that no criteria ?:.s uniformly most powerful. This is 
also the case for the GCM; however, in the GCM we have the additional problem of 
selecting the weight matrix G when p <q. If p=q, the transformation in (3.2) re- 
duces to 

Y = Y P^-^ 

or if P is an orthogonal matrix so that P = .P^, 

Y=^Y P' 

Os 

and there is no need to choose G. This was the approach taken by Bock (1963a) 
and the one used in the development of the NYMBUL package. Bock (1963b) and 

Finn (1972). If p < q the choice of G is important since it affects the variance 

—1 —1 
of 1/; which increases as G departs froinZ^ , the power of the tests and the 

widths of confidence bands. 

A simple choice of G is to set G^I. Then 

Y=Y P'(PP' )'"•'• 
o 

Such a choice of G will certainly simplify one's calculations; however, it is 
nS(art*T:he"1Gest choice in terms of power since information is lost by reducing 



Y to Y unless G is set eq^^^ estimator "of tl; = c'B'a/ when it is 

o o ^ ^ ^ 

2 

estimable and G is set equal to I~ is the BLUE of assuming ^: ~^ = a r. 

. o 

' 4. Rao-Khatri Conditional Model 

To try to avoid the arbitrary choice of the matrix G in Potthoff and lloy's 
model and its effect on estimates and tests, Kao (1965, 1966, 1967, 19 72 T and 
Khatri (1966) independently developed an alternative reduction of model (3.1) to 
a conditional model. 

(4.1) E(Y|Z):i- XB+Zi!* 

where Y (N x p) is a data matrix, X(N x m) is a known design matrix, B(m x p) 
is a matrix of unknown nonrandom parameters, Z(N x h) is a matrix of covariates 
and r (h x p) is a matrix of unknown regression coefficients. 

To reduce (3.1) to (4.1) a q x q nonsingular matrix H = (H^H^) is constructed 

so that the columns of form a basis for the vector space spanned by the rows 

of P, PH^ ^ I and PH^ 0/ When the rank of P is p, H- and can be selected .as 
1 2 X ^ 

= G''^P'(PG^'^P = I-H^P 

where G is an arbitrary positive definite matrix. Such a matrix H is not unique; 
however, estimates and tests are invariant for all choices of H satisfying the 
specified conditions (see Khatri, 1966). Hence, G in the expression for does 
not affect estimates or tests under (4.1). By setting 

Y - Y H,. = Y G^-^P' (PG'"''t')"'^ 

o ± o ■ 

(4.2) 

E(Y) = XB and E(2) = 0; thus, the expected value of Y given Z is seen to be of 
the form specified in (4.1), Khatri (1966) and Grizzle and Allen (1969). Using 
(4.1), the information contained in the covariates Z = Y H^s which is ignored 
in the Potthoff-Roy reduction, is utilized. 



Both Rao and Khatri argued that the BLUE under the conditional model of 
^ ^ c'B a is more efficient than that obtained by Potthoff and Roy since their 
estimator includes information in Z ignored by Potthoff and Roy- This is not 
the case. As shown by Lee (1974) and Tinmi (1975) employing the standard multi- 
variate analysis of covariance (MANCOVA)" model, : 

(4.3) B= (X'X)""X^Y^S'"-^P'(PS'"*''P')"*'' 

vhere S = Y' (l-X(X'X)"x' )Y . Khatri (1966) using the maximum likelihbod proce- 

O O A. 

"dSre^ obtained the same result for B/ Thus, if p< q, Rao's proWdure using q-p 
covariates, Khatri- using-^maximum- likelihood methods and Potthoff and Roy's method 
weighting by g"~'^=s'"'^ are identical. Setting G=I in the Potthoff and Roy method 
is equivalent to not Including any covariates in the Rao-Khatri reduction. When 
p=q, H2 does not exist; thus, the Rao-Khatri model is not applicable. 



Testing the hypothesis 



H : CBA = r 
o 



where r= 0, is not the same under the Potthoff and Roy and l<ao^Khatri reductions. 

Employing the standard MANCOVA model, \ 

■ \ \ 



= A'Y'X(X'X) C'(CRC') ""CCX'X) X'YA 

n 



\ 



' Q = A'(PS"^')'"^A 

e 

where 

R= (X'X)XX'X)"X'Y (s'-^-S^-^P' (PS"'^P')"'*^PS~'^)Y X(X'X)"" 

o o 

Y = Y S""'^P*(PS"""^P')~"^ 

o , 

V « g, V = N-r-h and h=q--p. 
h e 

Although Potthoff and Roy's approach does not allow G to be stochastic unless 
it is independent of Y^, it is interesting to compare (3.5) and (4.4) if G=S. 
Then 



= A'Y(I-X(X'X)-X')YA 

= (vs~h' )~hs~h (i-x(x'x)~x')Y s'-'-p' (ps~V)'-'-A 

o o 
= A'CPS^'4>')^V(PS"*'4>')"*''A 

= A' CPS""-'-? ' ) ""•''A 

which except for the degrees of freedom for error is identical to obtained 

under the Rao-Khatri reduction. The sum o£ squares and products matrix Q^, 
however, is not the same. . . 

The development of the GCM by Potthoff and Roy and the subsequent Rao-Khatri 
reduction has caused a great deal of confusion among experimentors trying to use 
the model in growth curve studies. The first major paper which helped to clarify 
and unify the methodologies was by Grizzle and Allen (1969), They also develop a 
procedure for selecting only a subset of the q-p covariates. 

5. Standard GCM 

Potthoff and Roy's analysis of the GCM was developed by introducing the 
transformation 

Y = Y G""-^P'(PG'"-4>')"'^ 
o 

to reduce the GCM to the SMM. To avoid having a test procedure that w;as dependent 
on an arbitrary positive definite matrix G, Rao (1965) and Khatri (1966) proposed 
an alternative reduction to the standard MANCOVA model which did not depend on G. 
Their procedure, as discussed by Grizzel and Allen (1969), depends on selecting 
the "best" set of q-p covariates. In addition, one may question the use of co- 
variates that are part of the transformed variables of the dependent variables 
being analyzed. To avoid these problems, Tubbs, Lewis and Duran (1975) developed 
a, test procedure to test 

H : CBA ^ r 
o 

employing maximum likelihood methods directly under the GCM. 
Under the GCM, the maximum likelihood estimator of is 

(5.1) B = (x'x)'"^"''x;y S^-^P' (PS^-^P')*^"^ 



and under H CBA = T , 
o 

(5.2) B„ = B - (X'X)'"C' (C(X'X)'"C')'"*^ (CBA - )(A' (PS''*'-p)""-^A)'"*^A' (PS"*^P' 

il . 
o 

Using the likelihood ratio criterion due to W 

- (CBA-r )• (C(X'X)"'C')"' ) 

n 

(5.3) = A'CPS''*^P')"'*^A 

Vhere V- « g and v = N-r, 
n e 

Comparing this result with that proposed by Rao and Khat ' see that 

each is different, but have the same degrees of freedom and cu Q is identical 

h e 

for both procedures, but have different degrees of freedom. However, as pointed 
out by Kleinbaum (1973), both procedures are asymptotically equivalent since they 
have the same asymptotic Wishart. distributions. No information is available about 
the two procedures for small samples or about the relative power of each procedure. 

6. Kleinbaum' s Generalized GCM 

In the analysis of growth curve data, observations at some time points are 
missing either by chance or design so that each dependent variate is not measured 
on each subject. In addition, the design matrix X may not be the same for each 
dependent variate. l^ile these problems have been discussed in the literature 
by Trawinski and Bargmann (1964), Srivastava and Roy (1965) and Srivastava (1966, 
1967, 1968), extending the theory of the SMM, Kleinbaum (1973) developed a gener- 
alized growth curve model (GGCM) for estimating and testing hypothesis when 
observations are missing either by chance or design with different design matrices 
corresponding to different response variates. 

To develop the GGCM, we assume we have N subjects with observations taken at 
q time points. Because of the incomplete data, the N subjects are divided into s 
disjoint sets S^,S2, where contains subjects. For i=l,2,..., s 
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measureraents are obtained at < Q time points. Letting (q x q^) represent 
an indicator matrix which specifies at which time points data in the set 3^ are 
obtained, the GGCM is represented as 

(6.1) i=l,'- ..,s 

oi i o i 

th 

where Y ^ (N.x q.) is the data matrix for the i set S . ,X. (N . x m) is the design 
oi 11. 111. 

matrix for set S^,B(m x p) is the matrix of unknown parameters, P is a known 

matrix of full rank p and 0^ (q x q^) is an indicator matrix of O's and I's for 

the set S^. Analogous to the GCM, we as|iume, that jy^^^^ 

pendent and the rows of Y „ are normally distributed. 
* oi 

Implicit in 6.1 is the fact that each subject in the experiment is from the 
same family so that ~ ^ 



where (q^ x 1) is the vector valued observation of the j^^ subject from set S 

As discussed by Srivastava (1967) and more generally by Kleinbaura (1970), 

to obtain BLUE of every parametric function c' B a In complex multivariate 

linear models (linear models with design matrices that are not the same for each 

dependent variate) that are independent of the unknown elements of the variance- 

covariancG matrix, requires additional restrictive conditions on the model (see, - 

e.g. Kleinbaum, 1970, p. 58). This led Kleinbaum (1973) to consider Best Asymp- 

totrlcally Normal (BAN) estimators for the GGCM which use consistent escimators 

of I and generally yield nonlinear estimators with variances that are in larj^e 

o * / 

samples the minimum that could be achieved by linear estimators if were known. 

To obtain a BAN estimator for an estimable function c»B a in the GGCM, 
(6.1) is conveniently represented in vector notation as 

14 



where y* , (N.q.xl) is obtained from Y ,by rolling out Y . columnwise and 
ox X i oi ox 

g*(mp X 1) is the columnwise version of B. Applying the general univariate 

least squares theory to (6.2), with Z any consistent estimator of ^ > a BAN 

A O . O 

estimator of i|;is ij^ = c'B a where 

. . . . ^ ' ^ ... - . . 

and g* is the columnwise representation of B, 

To test the null hypothesis H^: CBA = Fusing the GGCM, Kleinbaum (1973) 
prppps es_J:he: J\rald^ statist ics , — „_„^,-- — ^ _ „ „ 

= (C»BA-r)» (C^IpQ (0! (0!2 0,)"^0.P! H X'X,)"C)"*^ (C'B A -r) 
N i=i i X xox 11 .1 

and 

■ ■ ■\. 

w* = (c'B A -D' cc (fpa (e'z 0,)~\p' h q.)"c)~-^ (c b a r-r))' 

N i=l X 1 o 1 X 1 , 

where = X^X^-X' Y^^F' (F^Y J Y^^F')-^ F^Y^^X^ 



and is a column basis for I-P|(P^P^) P^ so that P^F^ 

Comparing the Wald Statistics with the test procedures proposed employing 

the GCM, W„/N is equal to the iRT procedure and W*/N is equal to the Rao-Khatri 
N N 

method using the Lawley-Ho telling Trace Criterion. 
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7 . Sununary 



To test hypothesis of the form H^: CBA«0 assuming a GCM with p <q, three 
approaches have been suggested to applied researchers over the past decade. 



Potthoff and 'Roy - 

Using the transformation Y=Y (T^^ (VG^^^)"^ and forming the estimator 
B=(X'X) X^Y, the hypothesis and error matrices are formed: 

= A^Y'X'(X'X^"'C'(C(X'X)^C')""*^C(X'X)'"X'YA 

n _ ^ ^ . V .... ,. — 

Q )"X')YA 
e 

where v, = g = R(C) , v = N-r and G is any symmetric p. d. weight matrix either 
h e 

non-stochastic or independent of Y^ such that PG"*^P' is of full rank. 



Tubbs, Lewis and Duran - 

Using maximum likelihood procedures, which is equivalent to setting G=S 
in the Potthoff and Koy model, they obtain 

B = (X'X)"'X'Y^S"'^P' CPS"'^P')'"*^ = (X'X)~X'Y 

A'YXCX'X)"C'(C(X'X)"C')"*^C(X'X)'X'YA 
Q = .X^^.Ml-X(X'XrX')YA-AMPS"*4>'r*^A 
where = g = R(C) , - N-r, S=y[ (I-X(X'X)'x' and Y=Y^s"^ (PS" P')" 
Rao-Khatri - 

Using a conditional model with 

B = (X'X)'X'Y^S~^' (PS'-'-p ')'■'■ = (X'X)"X'Y 

Y = Y S"-^P'(PS~-^P')"-^ 
o 

S = Y' (I-X(X'X)"X')Y 
o o 



the matrices 



= A'Y'X(X'X)~C' (CRO""''G(X'X)~X'YA 
Q^== 'i» (PS"-'-p')~"''A 

R .iX'X)"+(X'X)~X'Y (s"^-S"^P'(PS~^P')"-'-PS"^)Y X(X'X)" 

*^ o o 



are formed where ^ :^R(G) and « N-r-q+p. 

16 
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While the procedure set forth by Khatri has been "accepted" as the usually 
procedure employed in growth curve studies over the years and is asymptotically 
-equivalent to the procedure proposed by Tubbs, Lewis and Buran, we do not know 
which of the procedures are best in small samples. Perhaps the determination 
cannot be answered on the bases of power, but on whether in assessing growth 
the notion of conditional versus unconditional inference is being raised. Bock 
(1975). 

While the work of Kleinbaum has begun to address the data problems we have 
in analyzing data in the behavioral sciences, his procedure may lead to spurious 

'A 

test statistics since it depends on t:ho method used to estimate in the con- 
struction of the BAN estimator. 
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